On the Complexity of Optimizing Pagerank
نویسندگان
چکیده
EXTENDED ABSTRACT. In search engines, it is critical to be able to compare webpages according to their relative importance, with as few as possible computational resources. This is done by computing the PageRank of every webpage from the web [4] (i.e., the average portion of time spent in the webpage during an infinite and uniform random walk on the web). Pages with higher PageRank will then appear higher in the list of results. See [3] for a survey on PageRank and its applications. The concept of PageRank has generated a large number of questions and challenges. Among these, the problem of optimizing the PageRank of webpages raises increasing interest, as evidenced by the growing literature on the subject [1, 20, 7, 5, 17, 6, 10]. In the PageRank Optimization problem (PRO) that we study, one aims to maximize (or minimize) the PageRank of some target node when control is granted on some subset of free edges that may be chosen to be activated or deactivated. A typical application of PRO is the so-called webmaster problem in which a webmaster tries to maximize the PageRank of one of his webpage by determining which links under his control (i.e. on his website, or on an allied website for instance) he should activate and which links he should not [1, 7]. The same tools may be used to find how much the PageRank of some nodes can vary when the presence or absence of some links is uncertain (e.g. because a link is broken, the server is down or because of traffic problems) [17]. Similar situations also exist in economic networks in which agents choose partners in order to increase their centrality (i.e., their PageRank) [22] or decrease the centrality of other agents. For instance, a government might want to prevent a bank from acquiring excessive influence in order to limit the sensitivity of the bank network to a possible bankruptcy, and it should therefore allow or reject some transactions [2, 8, 11, 19]. Csáji et al. proposed a way of modeling PRO as a Markov Decision Process (MDP), thereby showing that an exact solution of the problem could be found in weakly polynomial time using linear programming [6]. (For more on MDP see, e.g., [21].) Yet in practice, MDPs are solved much more efficiently using algorithms adapted to their special structure. Among these algorithms, Policy Iteration (PI) [16] performs very well in practice and is guaranteed to converge to the optimal solution in a finite number of iterations. However, even though PI usually converges in a few iterations, its actual complexity remains unclear.
منابع مشابه
Policy Iteration is well suited to optimize PageRank
The question of knowing whether the policy Iteration algorithm (PI) for solving Markov Decision Processes (MDPs) has exponential or (strongly) polynomial complexity has attracted much attention in the last 50 years. Recently, Fearnley proposed an example on which PI needs an exponential number of iterations to converge. Though, it has been observed that Fearnley’s example leaves open the possib...
متن کاملThe Evaluation of the Team Performance of MLB Applying PageRank Algorithm
Background. There is a weakness that the win-loss ranking model in the MLB now is calculated based on the result of a win-loss game, so we assume that a ranking system considering the opponent’s team performance is necessary. Objectives. This study aims to suggest the PageRank algorithm to complement the problem with ranking calculated with winning ratio in calculating team ranking of US MLB. ...
متن کاملA Novel Approach to Feature Selection Using PageRank algorithm for Web Page Classification
In this paper, a novel filter-based approach is proposed using the PageRank algorithm to select the optimal subset of features as well as to compute their weights for web page classification. To evaluate the proposed approach multiple experiments are performed using accuracy score as the main criterion on four different datasets, namely WebKB, Reuters-R8, Reuters-R52, and 20NewsGroups. By analy...
متن کاملOptimal Operation of a Three-Product Dividing-Wall Column with Self-Optimizing Control Structure Design
This paper deals with optimal operation of a three-product Dividing-Wall Column (DWC). The main idea is to design a control structure, through a systematic procedure for plantwide control, with an objective to achieve desired product purities with the minimum use of energy. Exact local method is used to find the best controlled variables as single measurement or combination of measurements ...
متن کاملOptimal Operation of a DWC by Self-Optimizing Control: Active Vapor Split Approach
Dividing Wall Column(DWC) offers the large potential for operating and capital cost saving in compared with conventional distillation sequence. In the studied DWC in this study, the aid of Vmin diagrams, it is shown that without a suitable value for vapor split fraction bellow the dividing wall in different operating conditions, the energy requirement increases from optimal value and it wil...
متن کامل